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We consider the optimal design of a scheduling policy for adaptive video streaming in wireless 
"Small-Cells" networks. We formulate the problem as a network utility maximization, and we observe 
that it naturally decomposes into two subproblems: admission control and transmission scheduling. 
The resulting algorithms are simple and suitable for distributed implementation. The admission control 
decisions involve each wireless user choosing the quality of the video "chunk" asked for download, 
^ ' based on the local network congestion in its neighborhood. This is compatible with the current video 

OO ' streaming technology based on the DASH protocol over TCP connections. The transmission scheduling 

o : 

00 , policy takes on the form of a local maximization of the weighted sum rate at each small cell base station, 
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CO ' delay with which the video chunks are delivered, such that each user can decide its pre-buffering time at 

the beginning of a streaming session or re-buffering time in the case of a "stall" event (empty playback 

. . , buffer) during a streaming session. Through simulations, we evaluate the performance of the proposed 

rS ■ 
^ . algorithm under realistic assumptions for a small-cell wireless network. Our results show that the scheme 



C/3 



C^ 



is robust to arbitrary (per-sample path) network state variations due, for example, to random streaming 
activity and user mobility. 
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I. Introduction 

We consider the problem of joint transmission scheduling and congestion control for adaptive video 
streaming in a wireless network formed by number of small cell base stations serving wireless users 
over a given geographic coverage area |IT], ||2]- We formulate a Network Utility Maximization (NUM) 
problem in the framework of Lyapunov Optimization |l3], and derive algorithms for joint transmission 
scheduling and congestion control inspired by the drift plus penalty approach ||3l. 

Excellent surveys on various problem formulations in the NUM framework can be found in ||4]-||6]. 
Initial work in NUM focused on networks with static connectivity and time-invariant channels. One of 
the first applications of NUM was to show that Internet congestion control in TCP implicitly solves a 
NUM problem where the variant of TCP dictates the exact shape of the utility function IH, [61. This 
framework has been extended to wireless ad-hoc networks with time varying channel conditions (see 
Q, m and the references therein) and to peer-to-peer networks in Q. Recent work on adaptive video 
scheduling in wireless networks appears in lITOl . lITTI . 

Video streaming differs from file downloading since the duration of the download is much longer than 
the delay with which the playback begins with respect to the time at which the request is made. Hence, the 
video file is divided into segments referred to as "chunks", typically of video playback duration ss 0.5s. 
These chunks are requested sequentially, and the video quality of such segments can be dynamically 
adjusted depending on the local channel conditions (e.g., cell congestion, fading, pathloss changes due to 
mobility). After an initial pre -buffering delay, where the requesting user fills its playback buffer with an 
ordered sequence of chunks, the playback starts while the user is still requesting the remaining chunks. 
The playback process consumes the chunks at fixed playback rate (e.g., one chunk per 0.5s). In contrast, 
the number of ordered chunks per unit time entering the playback buffer is a random variable, due to 
the fact that the transmission resources are dynamically allocated by the scheduling policy, and that 
the network state (to be formally defined later on) may be time -varying. Notice that chunks must be 
ordered sequentially in order to be useful for video playback. If chunks go through different queues in 
the network and are affected by different delays, it may happen that already received chunks with higher 
order number cannot be used for playback until the missing chunks with lower order number are also 
received. Hence, the playback buffer input process is dominated by the largest delay with which the 
requested chunks are delivered. The pre -buffering time should be adaptively determined on the basis of 
an estimate of such largest delay, for each streaming user. We define the "stall" event as the event that 
a chunk is not in the buffer when it is needed for playback. In this case, several practical options are 



possible. For example, the playback may jump to the chunk of lowest order number available in the 
buffer. Alternatively, the playback may stop until the missing chunk is received. Also, when a stall event 
occurs, the user may decide to stop playback and dedicate some time to re-buffering, i.e., building up a 
sufficiently large number of ordered chunks in the buffer, before re-starting the playback. 

In our network model, wireless users place video streaming requests and wish to download sequences 
of video chunks, corresponding to the desired video files, from neighboring dedicated nodes referred to 
as helpers. User requests are served in a single-hop transmission (we do not consider multi-hop relaying 
in this work). The helpers may be small cell base stations |l2l, connected to some video content delivery 
network (CDN) through some wired backbone, or dedicated wireless video streaming nodes with local 
caching capacity, disseminated in the coverage area for the specific purpose of creating a "wireless 
CDN" |[T2l . |[T3l . Also, the helpers may be other user terminals with cached video files, operating in 
device-to-device mode 191, |[T4l . 

For the problem at hand, as a consequence of the NUM formulation, we obtain an elegant decomposition 
of the optimal solution into two separate subproblems, namely, transmission scheduling and admission 
control, which interact through the queue lengths. The subproblems are solved by distributed dynamic 
policies requiring only local queue length information at each network node. In particular, the transmission 
scheduling decisions are carried out in a decentralized fashion by the helpers, while the admission 
control decisions are carried out in a decentralized fashion by the users. It is interesting to notice that 
the independent choices made by every user in deciding the quality of the video chunk to request for 
download at every given time is compatible with the current technology based on client-driven Dynamic 
Adaptive Streaming over HTTP (DASH) for video on demand (VoD) systems |[T5l . |[T6l . 

Motivated by realistic system parameters, we assume that the time and frequency selective wireless 
channel fading coherence time x bandwidth product is small with respect to the number of signal 
dimensions spanned by the transmission of a video chunk (see Section JI] for typical values). This implies 
that the rate scheduling decisions in the transmission scheduling policy can be based on the ergodic 
achievable rate region of the underlying physical layer, where averaging is with respect to the small-scale 
channel fading, conditionally on the pathloss coefficients. This approach has the advantage of decoupling 
the scheduling decisions at the chunk level from the fine-grain physical layer details, such as power 
control and adaptive coding and modulation, which may be implemented at the physical layer at the 
time scale of the transmission time-frequency slots, independently and transparently with respect to the 
proposed scheduling scheme. In contrast, other system parameters such as the distance dependent path 
loss or the user activity (alternating idle/streaming state) may evolve at the same time scale of the video 



chunks, yielding non-ergodic dynamics. For example, a typical YouTube video streaming session spans 
between 4 to 10 min, which is sufficient for a user moving at walking speed to significantly change its 
relative distances with respect to surrounding helpers. Also, we have to consider that a coded video file 
is formed by chunks whose statistics may change with time. Correspondingly, variable bit rate (VBR) 
video coding || T7| yields a rate-quality tradeoff function that may change with time (i.e., with the chunk 
index). We address these non-stationary and non-ergodic dynamics of the system parameters by providing 
performance guarantees that hold for arbitrary sample paths, using the approach developed in ||3l, lITSl . 
The rest of this paper is organized as follows. In Section JIJ we describe our network and requests 
model for video streaming in a small-cell heterogeneous wireless network scenario, and discuss the key 
underlying assumptions. In Section |llll we formulate the problem in the NUM framework and derive a 
distributed dynamic scheduling algorithm for its solution using the drift plus penalty approach. In Section 
UVl we provide performance guarantees for any arbitrary sample path evolution of the network state and 
recover as a special case the corresponding guarantees when the network state is assumed to vary in an 
i.i.d manner. Section IV] illustrates our proposed policy for adaptive pre-buffering and re-buffering that is 
able to cope with the fact that, since different chunks of the same video streaming sessions go through 
different helpers transmission queues, the sequence of chunks received at the playback buffer may be 
out of order and some chunks may be missing at the required playback time. Finally, a realistic system 
simulation scenario is provided in Section |Vl] We investigate the effect of the key parameters regulating 
the proposed scheduling policy and affecting the tradeoffs of the relevant performance metrics, such as 
pre-buffering time, fraction of time spent in re-buffering mode, and the quality levels of the downloaded 
chunks. Our results show that the proposed algorithm dynamically adapts its decisions in response to 
variations in large scale path-loss coefficients due to mobility, to variations due to user activity (i.e., when 
users alternate idle times of random duration between successive streaming sessions), and to variations 
of the source coding rate across video chunks due to VBR video coding. 

II. System Model 

We consider a discrete, time-slotted wireless network with multiple user stations and multiple helper 
stations. The network is defined by a bipartite graph Q = {1{,T-L,£), where U denotes the set of users, 
Ti denotes the set of helpers, and £ contains edges for all pairs {h, u) such that there exists a potential 
transmission link between h G T-L and u gU. We denote by J\f{u) C T-[ the neighborhood of user u, i.e., 
M{u) = {he'H:{h,u) e £}. Similarly, M{h) = {u eU : {h,u) £ £}. 

Each user u gU requests a video file /„ from a library of possible files. As said before, a video file is 



formed by a sequence of chunks (e.g., corresponding to a group of pictures (GOP) that are encoded and 
decoded as stand-alone units |[T6l ). Chunks have a fixed duration, given by Tgop = (# frames per GOP)/r/, 
where 77 is the frame rate (frames per second), and must be reproduced in sequence at the user end. The 
streaming process consists of transferring chunks from the helpers to the requesting users such that 
the playback buffer at each user receiver contains the required chunks at the beginning of each chunk 
playback time. The playback starts after a short pre-buffering time, where the playback buffer is filled in 
by a determined amount of ordered chunks. We assume that the scheduler time-scale coincides with the 
chunk interval, i.e., at each chunk interval a scheduling decision is made. Conventionally, we assume a 
slotted time axis t = 0, 1, 2, 3 . . . , corresponding to epochs t x Tgop. Letting T^ denote the pre-buffering 
delay of user u, the chunks are downloaded starting at time t = and playback starts at time T^. A stall 
event for user u at time t > T^ is defined as the event that the playback buffer does not contain chunk 
number t — T^ at slot time t. 

The helpers may not have access to the whole video library. For example, if helpers are connected 
to the core network through a capacity limited wireless link, then it is convenient to cache locally a 
subset of the possible files and refresh the caches at much slower rate during the traffic off -peaks |ll2]. 
In general, we denote by 'H(/) the set of helpers that contain file /. Hence, the request of user u 
for a chunk at a particular slot t can be assigned to any one of the helpers in the set J\f{u) r\'H{fu)- 
Letting iVpix denote the number of pixels per frame, a chunk contains k = 'qTgopNpix pixels (source 
symbols). We assume that each chunk of file / is encoded at a finite number of different quality modes 
m € {1, . . . ,Nf}. This is similar to what is currently implemented in several recent video streaming 
technologies Uke Microsoft Smooth Streaming and Apple HTTP Live Streaming |[T9l . Due to the variable 
bit-rate nature of video coding, the quality -rate profile may vary from chunk to chunk. We let Df{m, t) 
and kBf{m, t) denote the video quality measure (e.g., see EOl ) and the number of bits for file / at chunk 
time t and quality mode m respectively. A fundamental function of the network controller at every slot 
time t consists of choosing the quality mode ?n„(t) for all requesting users u. The choice mu{t) renders 
the choice of the point {Df^^ {mu{t),t), kBf^^ {mu{t),t)) from the finite set of quality-rate tradeoff points 
{{D f^{m,t) ,kB f^{m,t))} ^^^. We let Rhu{t) denote the source coding rate (bit per pixel) of chunk t 
received by user u from helper h. In addition to choosing the quality mode mu{t) for chunk time t for 
all requesting users u, the network controller also allocates the source coding rates Rhu{t) satisfying 

Y^ Rhu{t) = BfSmu{t),t), \J{h,u)e£. (1) 

When Rhu{t) is determined, helper h places the corresponding kRhu{t) bits in its transmission queue 



Qhu, to be sent to user u within the queuing and transmission delays. Notice that in order to be able to 
download different parts of the same chunk from different helpers, the network controller needs to ensure 
that all received bits from the serving helpers J\f{u)n'H{fu) are useful, i.e., the union of all received bits 
spans the base layer up, without overlaps and without gaps. Alternatively, each chunk can be encoded 
by intra-session Random Linear Network Coding |[2T1 such that as long as kBf^{mu{t),t) parity bits 
are collected at user u, the t-th chunk can be decoded and it becomes available in the user playback 
buffer. Interestingly, our solution derived in Section |III] yields that even though we allow the possibility 
of downloading different bits of the same chunk from different helpers, the optimal scheduling policy 
is such that users download only entire chunks from single helpers, rather than obtaining different parts 
from different helpers. Thus, it turns out that the assumption of protocol coordination or linear network 
coding to prevent overlap or gaps of the downloaded bits from different helpers is not needed. 
The dynamics of the transmission queues at the helpers is given by: 

Qhu{t + 1) = max{Qhu{t) - nnhuit), 0} + kRhu{t), V {h, u) € £, (2) 

where n denotes the number of physical layer channel symbols corresponding to the duration Tgop, and 
fJ'hui't) is the channel coding rate (bits/channel symbol) of the transmission from helper h to user u at 
time t. 

We model the point-to-point wireless channel for each {h, n) G <S as a frequency and time selective 
underspread ||22| fading channel. Using OFDM, the channel can be converted into a set of parallel 
narrowband sub-channels in the frequency domain (subcarriers), each of which is time-selective with a 
certain fading channel coherence time. We assume the widely adopted block fading model, where the 
small scale Rayleigh fading coefficient is constant over time-frequency "tiles" spanning blocks of adjacent 
subcarriers in the frequency domain and blocks of OFDM symbols in the time domain. For example, in the 
LTE 4G standard |[23l . the small scale fading coefficients can be considered as constant over a coherence 
time interval of 0.5 ms (7 OFDM symbols) and a coherence bandwidth of 12 sub-carriers, corresponding 
to a bandwidth of 12 x 15kHz = 180kHz for a carrier spacing of 15kHz. Such tiles are referred to as 
resource blocks. For a total system available bandwidth of 18MHz (after excluding the guard bands) and 
a scheduling slot of duration Tgop = 0.5s (typical GOP duration), we have that a scheduling slot spans 
5-io-^^^xi80-io3 ~ ^^^ resource blocks, each of which is affected by its own fading coefficient. Even 
assuming some correlation between fading coefficients, it is apparent that the time-frequency diversity 
experienced in the transmission of a chunk is very large. Thus, it is safe to assume that channel coding 
over such a large number of resource blocks can achieve the ergodic capacity of the underlying fading 
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channel. [J For simplicity, in this paper we consider constant power transmission, i.e., the serving helpers 
transmit with constant and flat power spectral density over the whole system bandwidth, irrespective of 
the scheduling decisions and of the instantaneous fading channel statejj We further assume that every 
user u when decoding a transmission from a particular helper h € 7V^(u) treats the interference from 
other helpers as noise. Under these system assumptions, the maximum achievable rate at slot time t for 
link (/i, n) € £" is given by 

PhghuitMahul"^ 



Chu{t) = E 



log 1 + 



(3) 



^ + zZh'TLhPh'9h'uV')\'^h'u\ ^ 

where Ph is the transmit power of helper h, a^u is the small-scale fading gain from helper h to user u 
and ghu{t) is the slow fading gain (pathloss) from helper h to user u. 

We assume that each helper h serves its neighboring users u € M{h) using orthogonal FDMA/TDMA. 
Therefore, the set of rates {iJ,hu{t) ■ u £ ■^(^)} is constrained to be in the "time-sharing region" of the 
broadcast channel formed by helper h and its neighbors J\f{h). This yields the transmission rate constraint 

Also note that in practice, for a dense deployment of helpers, each helper h serves only a small number of 
users, i.e., \M{h)\ is small and therefore scheduling at the physical layer for the sake of achieving multi 
user diversity gains does not provide significant improvements in terms of ergodic rates. Therefore, for 
simplicity we neglect such a possibility and use the above ergodic rate expression that assumes to schedule 
users independently of their instantaneous small-scale fading channel realizations, thereby avoiding the 
need for fast channel state information feedback. It is however assumed that some form of slow channel 
state information feedback exists, consistent with rate adaptation schemes currently implemented in WiFi 
and LTE [23.1 . This allows the helpers to adapt their rate such that the rate vector allocated by each helper 
h G T-L to its served users u € M{h) is inside the (generally time-varying) achievable rate region ^ at 
each time t. 

'in this paper we refer to ergodic capacity as the average mutual information resulting from Gaussian i.i.d. inputs of the 
single-user channel from helper h and user u £ J^{h), while treating the signals of all other helpers h' j^ h as noise, where 
averaging is with respect to the first-order distribution of the small-scale fading, conditionally on the set of slow fading gains. 
This rate is achievable by i.i.d. Gaussian coding ensembles and approachable in practice by modem graph-based codes provided 
that the length of a codeword spans a very large number of independent small-scale fading states 1241 . 

^This assumption is consistent to a WiFi offloading system, where the helpers coincides with the WiFi access points, and 
operate at fixed power and over the whole system bandwidth. 



The slow fading gain ghu{t) models path loss and shadowing between helper h and user u, and it is 
assumed to change slowly in time. For a scenario typical of small cell networks, where users are nomadic 
(e.g., moving at walking speed), the slow fading coefficients change on a time-scale of the order of 10s 
(i.e., « 20 scheduling slots). This time scale is much slower than the coherence of the small-scale fading, 
but it is comparable with the duration of the video chunks. Therefore, variations of these coefficients 
during a streaming session (e.g., due to user mobility) are relevant. We let uj{t) denote the network state 
at time t, i.e.. 

Let ^(^(i) be the set of feasible control actions, dependent on the current network state uj{t), and let 
a{t) € A^{t) be a control action, comprising the vectors R(t) with elements kR^uit) of video coded 
bits, /x(f) with elements nfihu{t) of channel coded bits and mu{t) \/ u ^ U. A control policy for the 
system at hand is a sequence of control actions {a(t)}^g where at each time t, a{t) € ^a;(t)- 

III. Problem Formulation and Control Policy Design 

In the proposed NUM problem, the goal consists of designing a control policy which maximizes 
a concave utility function of the time averaged video qualities of the users, subject to keeping the 
transmission queues at every helper stable. Throughout this work, we use the following notation for the 
long-term time average expectation of any quantity x: 

1 *"^ 
X := l\m-Y,^[x{r)] (5) 

Define D^ ■= linit^oo j St=o ^ I^A. ("^mI''"), t)] as the long-term time average of the expected quality 
of user u, and Q^^ := Iimt_i.oo j X]t=o ^ IQhu (''")] as the long-term time average of the expected length 
of the queue at helper h for data transmission to user u assuming temporarily that these limits exist |j. 
Let (/)«(•) be a concave, continuous, and non-decreasing function defining utility vs. video quality for 
user u gU. The goal is to solve: 

maximize y^(f)u{Du) (6) 

subject to Qf^^ < CO y {h,u) e £ (7) 

a{t) G A^^t) V t, (8) 

^ The existence of these limits is assumed temporarily for ease of exposition of the optimization problem (|6} - ([8} but is not 
required for the analysis in Section HV] 



where constraint ^ corresponds to the strong stability condition for all the queues Qhu- 

Problem dill - ([8]l can be solved using the stochastic optimization theory of Q. Since it involves 
maximizing a function of time averages, it is convenient to transform it into an equivalent problem that 
involves maximizing a single time average instead of a function of time averages. Then, the drift plus 
penalty framework of |]3] can be applied. This transformation is achieved through the use of auxiliary 
variables 7u(i) and corresponding virtual queues 6u(t) with buffer evolution: 

e«(t + l)=max{e„(i)+7«(i)-^/„("i«(i),i),0}. (9) 

Consider the transformed problem: 

^JJ^ (10) 



maxunize 



subject to (5/j„ < oo V (/i, n) G (5 (11) 

7« < ^« V n G U (12) 

Df"" < luit) < D^^ yu € U (13) 

a{t) e ^^(t) V t (14) 

where W^^^ is a uniform upper bound on the maximum quality Dj^ {^f^ 1 1) and -D™"^ is a lower bound on 
the minimum quality Df^{l,t), for all chunk times t. Notice that constraints (IT2l) correspond to stability 
of the virtual queues 0^, since 7„ and Du are the time-averaged arrival rate and the time-averaged 
service rate for the virtual queue given in (0. We have: 

Lemma 1: Problems Q - ([8]) and (ITOl ) - (fT4l ) are equivalent. 

Proof: Let (p°^^ and (p^^^ be the optimal solutions of problems © - dS) and ([TOll - (Ell, respectively. 
Let a*{t) be a feasible policjO that achieves c/jg'' . Since 0u(-) is concave for all u G U, by Jensen's 
inequality, we have 

ueu ueu 

and since the policy a*{t) satisfies the constraint (fTZi i. (pu{-) is non-decreasing for all u gU, we further 

have 

^MD*)>^<Pu6Z)- (16) 

ueu ueu 

Note that there is a shght abuse of notation here because we now assume that the poMcy a*(i) also includes the decisions 
7*(i) — (7i (i), . . . ,7|t(|(i)) in addition to the decisions R*(i), /^'(t) and m*(t) for all u G W, as indicated at the end of the 
previous section. 
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Notice that since a*{t) is feasible for the transfomied problem (ITO^ - (IT4l . then it also satisfies the 
constraints of the original problem (|6) - (|8j and therefore it is feasible for this problem. Thus we 
conclude that: 

</'ipt>E'^«(^)^'^opf (17) 

ueu 

Now let a (t), comprising of the decisions R (t), /i and m^{t) for all u G U, he an optimal poicy 
for the original problem, achieving (f>°^ . Since a (t) satisfies the constraints d?]), dSjl, it also satisfies the 
constraints (ITTI ). (IT4] | of the transformed problem. Further, we choose 7 (t) = D' for all time t. Such 
choice of 7' (t) together with the policy a (t) forms a feasible policy for the transformed problem (ITO) 
- (El). Therefore: 

^T > E ^^^) = E '^"TO = -^r- (18) 

Thus, from dTTl ) and dTSl ). we have 0°'' = 02^ and by comparing the constraint it is immediate to 
conclude that an optimal policy for the transformed problem can be directly turned into an optimal 
policy for the original problem. ■ 

Let Q(t) denote the column vector containing the backlogs of queues Qhu V {h, u) £ £ at time t and 
let 0(t) denote an analogous column vector for the virtual queues Qu"^ u ^U. Also let ■y{t) denote the 
vector with elements ^u{t) y u ^U and D(t) denote the vector with elements Df^{mu{t),t) \/ u £U. 
The drift plus penalty (DPP) policy which solves the problem ( fTOl ) - (fT4l ) is obtained as follows. Let 
G{t) = [Q^(t), 0^(t)] be the combined vector of queue backlog vectors and define the quadratic 
Lyapunov function L(G(t)) := ^G^(t)G(t). The one-slot drift of the Lyapunov function at slot t is 
given by 

L(G(i + l))-L(G(t)) 

= i(QT(t + l)Q(i + 1) - QT(t)Q(i)) + i(0T(t + i)0(t + 1) - 0T(t)0(i)) 

= - [(max{Q(t) - /x(t), 0} + R(t))^ (max{Q(t) - ^l{t),0} + R(t)) - Q"^(t)Q(t) 



1 

+ 2 



(max{0(t) + 7(t) - D(t), 0})"^ (max{0(t) + 7(t) - D(i), 0}) - 0"^(t)0(t) . 

(19) 
Note that for Q > 0, /x > 0, i? > 

(max{Q - //, 0} + Rf < Q^ + fi^ + R^ + 2Q{R - //), (20) 
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and for e > 0, 7 > and L> > 

(max{e + 7 - £>, 0}f < (G + 7 - D)^ = 9^ + (7 - Df + 26(7 - D). (21) 

Using inequalities (l20l ) and (|2TI) in (IT9l ). we have 

L(G(t + 1)) - L(G(t)) < ^/xT(t);x(t) + RT(t)R(t) + (R(t) - ;x(t))^ Q(t) 

+ i (7(t) - m)f (lit) - D(t)) + (7(t) - D(t))"^ e(t) (22) 

<B + (R(t) - mW)"" Q(i) + hit) - Bit))'' Git). 

where S is a uniform bound on the term i [/^"'"(t)/^(i) + R"'"(t)R(t)] + 1 (7(t) - D(t))"'" (7(t) - D(t)), 
that exists under the realistic assumption that the source coding rates, the channel coding rates and the 
video quality measures are upper bounded by some constants, independent of t. The conditional expected 
Lyapunov drift for slot t is defined by 

A(G(t)) := E [L(G(t + l))|G(i)] - L(G(t)) (23) 

Adding on both sides the penalty term —Vj^u&u'^ ['Pui7ui't))\Git)], we have 

A(G(t)) -V^E [^„(7„(t))|G(i)] <B-V^E [0„(7,(t))|G(t)] + E [(R(t) - /^(i))^ Q(t)|G(t) 



+ E[(7(t)-D(t))"^e(t)|G(t)' 



(24) 



Here, y > is a control parameter of the DPP policy that affects the utility-delay tradeoff. The DPP 
policy acquires information about G(t) and a;(t) at every slot t and chooses a(i) € A^n\, 7u(t) ^ u £ U 
to minimize the right hand side of the above inequality. Then, the resulting DPP policy is given by the 
minimization, at each chunk time t, of the function: 



R"^(t)Q(t)-D"^(t)0(t)^ - /^"^(t)Q(t) 

admission control 

transmission 



v^Miuit))-i''it)&it) 

. ueu 



(25) 



, , ,■ obi. maximization 

scheduling ■' 

The choice of R(t) and of ruuit) V u G ZY affects only the term R"'"(t)Q(t) — D"'"(t)0(t), the choice of 
/x(t) affects only the term /x"'"(t)Q(t), and the choice of 7(t) affects only the term V J2ueu 0«(7m(O) ~ 
7^(t)0(t). Thus, the overall minimization decomposes into three separate sub-problems. The first two 
sub-problems have a clear operational meaning, and will be referred to as admission control and trans- 
mission scheduling. Admission control consists of choosing rriuit) (quality indices for chunk requested 
at time t by user n) and the rates Rhuit) of source-coded bits requested by each user u from its 
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neighboring helpers h G N{u). Transmission scheduling consists of allocating the channel transmission 
rates fihu{t) for each helper h to its neighboring users u G J\f{h). The third sub-problem involves the 
greedy maximization of each user network utility function with respect to the auxiliary control variables 

7n(t). 

A. Admission Control 

The admission control sub-problem objective function (see (|25]) ) can be explicitly expressed as 



^l Yl kQhu{t)Rhu{t)-eu{t)DfAmu{t),t)\. (26) 

The minimization of this quantity decomposes into separate minimizations for each user, namely, for 
each u ^U, choose mu{t) and Rhu{t) V /i G J\f{u) n T-Lifu) to minimize 

Y^ kQhu{t)Rhu{t)-eu{t)Df^{mu{t),t) (27) 

with {Rhu{'t)}heAf(u)nH(U) satisfying ([1]). It is immediate to see that the solution consists of choosing the 
helper /i* G M{u) n ^{fu) with the smallest queue backlog Qhu{t), and assigning the entire requested 
chunk to /i* . Notice that in this way the streaming of the video file fu may be handled by different helpers 
across the streaming session, but each individual chunk is downloaded from a single helper. Further, the 
quality mode m„(t) is chosen as 

m„(t) = arg min {A;Q,,.„(t)S/Jm, t) - G„(t)L'/„(m, t)} . (28) 

m.e{l,...,Nf^} 

In order to implement this policy, it is sufficient that each user knows only its local information of the 
queue backlogs of its neighboring helpers. Congestion control decisions are decentralized: each user u 
chooses the helper h G J\f{u) n Tiifu) with minimum backlog, and requests the t-th chunk at a quality 
level according to (|28] |. This policy is reminiscent of the current adaptive streaming technology for 
video on demand systems, referred to as DASH (Dynamic Adaptive Streaming over HTTP) |[T5l . |[T6l . 
where the client (user) progressively fetches a video file by downloading successive chunks, and makes 
adaptive decisions on the source encoding quality based on its current knowledge of the congestion of 
the underlying server-client connection. 

B. Transmission Scheduling 

Transmission scheduling involves maximizing the weighted sum rate Xlhew SueAAf/i) Qhu{'t)l''hu{'t) 
where the weights are the queue backlogs (see (|25Tl). Under our system assumptions, this problem 
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decouples into separate maximizations for each helper. Notice that here, unlike conventional cellular 
systems, we do not assign a fixed set of users to each helper. In contrast, the helper-user association is 
dynamic, and results from the transmission scheduling decision itself. For each h ^ Ti, the transmission 
scheduling problem can be written as the Linear Program (LP): 

maximize ^ Qhu{t)f^hu{t) (29) 

uGAfih) 

subject to Y. tHI^I- (30) 

u£Af(h) 

By a change of variables Vhuit) = ^'"' \1 , the LP reduces to 

maximize ^ Qhu{t)Chu{t)j^hu{t) (31) 

subject to VJ I'huit) < 1- (32) 

uGAfih) 

The feasible region of the above LP is the |7V^(/i) [-simplex polytope and it is immediate to see that the 
solution consists of scheduling the user njlj G Af{h) with the largest product Qhu{t)Chu{t)^ and serve 
this user at rate fi^u* (t) = Chu' (0' while all other queues of helper h are not served in slot t. 

C. Greedy maximization of the network utility function 

Each user u gU keeps track of @u{t) and chooses its virtual queue arrival 7u(t) in order to solve: 

maximize V (pui^uit)) - Qu(t)ju(t) (33) 

subject to Df'' < 7„(t) < D^^^. (34) 

These decisions push the system to approach the maximum of the network utility function. By appropri- 
ately choosing the functions (pu, we can impose some desired notion of fairness. For example, a general 
class of concave functions suitable for this purpose is given by the a-faimess network utility, defined 

by 123 

log X a = 1 



(puix) = (Pa{x) 



(35) 
^ a > 0, a / 1 



In this case, it is well-known that a = yields the maximization of the sum quality (no fairness), a — > cxd 
yields the maximization of the worst-case quality (max-min fairness) and a = 1 yields the maximization 
of the geometric mean quality (proportional fairness). 
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IV. Performance Analysis: Arbitrarily Time Varying Network State 

As outlined in Section JIl VBR video encoding yields a time-varying quality-rate tradeoff that depends 
on the individual video file, and arbitrary user motion yields time variations of the path coefficients at 
the time-scale of the video streaming session. As a result, any stationarity or ergodicity assumption about 
the network state process uj{t) is unlikely to hold in most practically relevant settings. Therefore, in this 
section we consider performance guarantees of the DPP policy for any arbitrary sample path uj{t). We 
adopt the universal scheduling technique developed in 131 and |[T8l to address this scenario. We use a 
metric called the T-slot lookahead metric for evaluating the DPP policy. Time is split into frames of 
duration T slots and we consider F such frames. For any arbitrary sample path u){t), we consider the 
static optimization problem over the j'-th frame: 



ueu \ T=jT 



a+i)T-i 

max (/>j := > ^ </>« I 7^ Y] Du{t) I (36) 



^ (j+i)T-i 
subject to - ^ [kRhu (r) - rifihu (r)] < V (/i, n) € ^ (37) 

T=jT 

a{t) G A^(t) V t e {jT, . . . , (j + 1)T - 1} (38) 

We denote by (/)* the optimal solution to the above problem. Thus, the value (j)* is the maximum utility 
for frame j that can be achieved over all policies which have future knowledge of the sample path Lo{t) 
over the j-th frame, subject to the constraint <BT\ which ensures that for every queue Qhu^ the total 
service provided over the frame is at least as large as the total arrivals in that frame. As in Section |III] 
we consider the following equivalent problem which involves only time averages and introduces the 
auxiliary variables 7m (t): 

-. iJ+l)T-l 

"1^^ T T. Y.'t'uiluiT)) (39) 

T=jT ueU 

^ (J+1)T-1 

subject io - Y^ [kRhu (r) - nfihu (r)] <0y {h,u)e£ (40) 

T=jT 

. 0-+l)T-l 

^ Y. buir)-DuiT)]<oyueU (41) 

T=jT 

i^r" < iu{t) < Dr"" yueu,yte {jT, . . . , (j + i)r - 1} (42) 

a{t) e A^^t) V t E {jT, . . . , (j + l)r - 1}. (43) 
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The update equations for the transmission queues Qhu V (/i, u) £ £ and the virtual queues 0u V n G iY 
remain the same as dU and (|9]l respectively. Let G(t) = [Q^(t),0 (t)] be the combined queue 
backlogs column vector, and define the quadratic Lyapunov function 

L(G(t)) = ^GT(i)G(t). 

Fix a particular slot r in the j'-th frame. We first consider the 1-slot drift of L(G(t)). From (l22l ). we 
know that 

L(G(t + 1)) - L(G(r)) <B+ (R(t) - /^(r))^ Q(r) + (7(r) - D(t))T e(r) (44) 

where B is a uniform bound on the term i [tjJ{t)n{t) + R"^(t)R(t)] + i (7(4) - D(i))"'" {y{t) - D(i)), 
that exists under the realistic assumption that the source coding rates, the channel coding rates and the 
video quality measures are upper bounded by some constants, independent of t. We choose B such that 

B > 2k^k (45) 

where k is a vector whose components are all equal to the same number k and this number is a uniform 
upper bound on the maximum possible magnitude of drift in any of the queues (both actual and virtual) 
in one slot. 

With the additional penalty term —V Xl^gw 4'uilu{T)) added on both sides of (l44l) . we have the following 
drift plus penalty inequality: 

L(G(r + 1)) - L(G(r)) - F J] Mlu{r)) <B + (R(t) - /^(r))"^ Q(r) + (7(r) - D(t))"r 0(r) 

-VY,M1u{t)) (46) 

u&A 

Let {a(T)}^"L J. denote the DPP policy which minimizes the right hand side of the drift plus penalty 
inequality ( |46l ). Since it minimizes the expression on the RHS of ( |46l ). any other policy {a* {T)y^-j. ~ 
comprising of the decisions {m:(r)}[i+/^^-\ {R*(r)}[i+/^^-\ {M*(r)}HJ'/^^~' and {7*(r)}!.i7T^"' 
would give a larger value of the expression. We therefore have 



L(G(r + 1)) - L(G(t)) -VY, Mluir)) <B + (R*(r) - p*{T)f Q(r) + (7*(r) - D*(r))"^ 0(r) 

-^E'^«(^-(^))- (47) 



ueU 



U&J 



16 

Further, we note that the maximum change in the queue length vectors Qhu{T) and 0«(t) from one slot 
to the next is bounded by k. Thus, we have for all r G {jT, . . . ,{j + 1)T — 1} 

\Qhu{r) - QhuiJT)] < {t-jT)k V {h,u) E £ (48) 

\@u{r) - e„(jT)| < iT-jT)K yuGU (49) 

Substituting the above inequalities in (l47l) . we have 

L(G(r + 1)) - L(G(t)) -VY, Mluir)) <B + (R*(r) - ^l*{T))' (QiJT) + (r - jT)k) 

+ {-f*{T) - D*(r))"^ {©{jT) + (r - jT)^) 

Then, summing (l50l) over r G {jT, . . . , (j + 1)T — 1}, we obtain the T-slot Lyapunov drift over the j-th 
frame: 

jT+T-l 

L{G({j + 1)T)) - L{G{jT)) - V Y. E'^«(^«(^)) 

T=jT ueu 

^^^+(E!I^~'(I^*(^)-/^*(^))) Q(iT)+(^j;^^^"'(R*(r)-/.*(r))(r-jT)) /. 

-^E!!-r'5]'^'^(7:w) (51) 

Using the inequalities R*(r) - Ai*(T) < 2k, 7*(t) - D*(r) < 2k in dSTJ, we have 

jT+T-l 

L(G(0- + l)r))-L(G(jT))-y j; j;<^«(7.(r)) 



< 



ueu 
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Using k-'k < f , Eil^T~\^ - JT) = ^^, we get 

jT+T-l 

L{G{{3 + l)T))-L{G{jT))-V Y. Y.^-^^-^^)) 

T=jT ueu 

<BT + BT{T - 1) + (E!I^r' (^*(^) - ^*(^))) Q^^'^) 

We now consider the policy {a*(T)}^"L ■-/ satisfying the following constraints: 

^ {i+i)T-i 



^ (i+i)r-i 

^ E [7:(T)-I?:(T)]<-eVnGW 



T 

r=jT 



(54) 



(55) 



where e > is arbitrary. We plug in the inequalities (|54] |. (l55i in (l53T l and obtain 

jT+T-l 

L(G((j + i)r)) - L(G(jT)) - y E E'^«(^«(^)) 

<BT^-eT E Q/.„(iT)-6rE0Jjr)-^E'I^T 'l^-^^^^^^)) (56) 

{h,u)es u&A ueu 

Also, considering the fact that for any vector 7 = (71, ... , 7|2^|) we have 

E MDfn = '/'min < E '^«(^«) ^ -^-^ = E MDn, (57) 

•ugw ugw uew 

we can write: 

L(G((j + l)r)) - L{G{jT)) < BT^ + yr(0^ax - 0min) - e^ E QhuiJT) - e^ E 0«(iT) 

{h,u)e£ u&U 

(58) 
Once again using (1481 ). (l49l ). we have: 

jT+T-l 

L(G((i + l)T))-L(G(jT))<i3r2 + yr(,^,,,,-0,,iJ-e ^ J] Q;,„(r) 
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Summing the above over the frames j € {0, . . . , F — 1} yields 

L(G((FT)) - L(G(0)) < BT^F + VFT{ct>ra.. - </'mm) - e J] Yl Qhu{r) 

T=o (h,u)es 

Rearranging and neglecting appropriate terms, we get 

^ ^sr \- n ( \^ ^ ^srs-c^ ( ^ ^ sr y(0^3^^^>^ L(G(o)2 

FT 1^ 1^ Qhuir) + pf l^X. ®«(^) < — + 1 + eFT 

T=o (h,u)e£ T=o ueu 



(60) 



^ K(|g| + |^|)(T-1) ^^^^ 



Taking limits as F — ;• oo 



1 ^z^i / \ i?r , y(c/>^ax - 0min) , K(|g[ + |z^|)(r - 1) 
i™^;^ Z^ Z^ Qhuir) + }_^QuiT)] < — + + 

r=0 \{h,u)£e ueu 



(62) 



,{i+l)T-l 



We now consider the policy {a*{T)y^^-rf which achieves the optimal solution (j)* to the problem 
dUll - (|43]). Using (gOll and dU) in dSS]), we have 

jT+T-l 

L{G{iJ + l)T))-L{G{jT))-V Y, J]</.„(7jr))<Br + i?r(r-l)-yr0} (63) 

T=jT ueu 

Summing this over j € {0, . . . , F — 1}, yields 

FT~1 F-1 

L{G{{FT)) - L(G(0)) -V YY. Mlu{r)) < BT^F -VT^ 4>}- (64) 

T=0 ueU r=0 

Dividing both sides by VFT and using the fact that L(G((FT)) > , we get 

FT~1 , F-1 



T=o ueu /=o 



(65) 



At this point, using Jensen's inequality, the fact that ipui-) is continuous and non-decreasing for all 
u gU, and the fact that the strong stability of the queues ( [62l ) implies that Huif-^oo -^ Y1t=o ©«(■'") < 
oo V n G ZY, which in turns implies that 7„ < Z?„ \/ u gU, we arrive at 



F-l 



ueu /=o 



(66) 



Thus, the utility under the DPP policy is within 0{1/V) of the time average of the (j)* values which 
can be achieved only if knowledge of the future states up to a look-ahead of blocks of T slots. If T is 
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increased, then the value of 0* for every frame j improves since we allow a larger look-ahead. However, 
from (l66l ). we can see that if T is increased, then V also has to be increased to maintain the same 
distance from optimality which in turn would prompt an 0{V) increase in the queue backlog. Similar 
performance guarantees for the case when uj{t) is i.i.d can be recovered as a special case of the above 
analysis and are omitted. 

V. Pre-buffering, re-buffering and skipping chunks 

As described in Section H the playback process consumes chunks at fixed playback rate (one chunk 
per time slot) while the number of ordered chunks per unit time entering the playback buffer is a random 
variable, due to the fact that the transmission resources are dynamically allocated by the scheduling policy, 
and that the network state uj{t) may be time-varying. In addition, it may happen that chunks which go 
through different queues in the network are affected by different delays. This may give rise to a situation 
where already received chunks with higher order number cannot be used for playback until the missing 
chunks with lower order number are also received. An example is illustrated in Table U and Figure [T] The 
table indicates the chunk numbers and their respective arrival times. The blue curve in Figure [T] shows 
the evolution with time of the number of ordered chunks which are available and are ready for playback. 
On the other hand, the green curve indicates the evolution with time of the number of chunks consumed 
in playback. The playback consumption starts after an initial pre-buffering or startup delay d as indicated 
in the figure. At any instant t, the chunk requested at t — d is expected to be available in the playback 
buffer. However, if the chunk is delivered with a delay greater than d, the two curves meet and there is an 
interruption/stall event in the playback process. Hence, in order to prevent the stall event, a user would 
ideally like to choose d to be larger than the maximum delay in delivering the chunks. Unfortunately, 
such maximum delay is neither deterministic nor known a priori. In fact, it depends on the dynamics 
of the network state uj{t) (including user mobility, streaming activity and the non-ergodic non-stationary 
dynamics of the video VBR coding, as discussed in Section ^. Hence, this maximum delay must be 
estimated adaptively and then the initial pre-buffering time and the re-buffering time in the case of a stall 
event must be chosen such that the number of ordered chunks in the buffer is sufficiently larger than 
the observed maximum delay. Every user u can estimate its local delays by monitoring its requests and 
delivery time in a sliding window spanning a fixed number of time slots in the past. Finally, a user can 
also skip a chunk if by doing so, it can provide a large jump-up in the number of ordered chunks. For 
instance, in Table U and Figure [I] the chunk which comes 4**^ in the ordered sequence arrives at the end 
of time slot 11. However, chunks numbered 5, 6, 7 and 8 aixive before slot 11 but cannot be played since 
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Fig. 1 ; Evolution of number of ordered and consumed chunks 

4 is missing. More generally, if chunk 4 were to arrive with a delay such that the number of chunks 
which arrive before 4 but come later in the ordered sequence becomes large, then the user could either 
continue waiting for the missing chunk and incur a stall event or it could be worthwhile to skip chunk 
4 from playback and provide a large jump-up in the buffer. 

We now describe formally the proposed policy for pre-buffering, re-buffering and chunk skipping that 
each user u implements locally. Let T^ denote the time slot in which a user requests the k^^ chunk and 
let Ak be the time slot in which the chunk arrives at the user playback buffer The delay for chunk k 
is Wk = Ak — Tfc. Without loss of generality, consider a user u starting its streaming session at time 
t = 1. In the proposed scheduling policy, user u requests one chunk per scheduling time, sequentially and 
possibly from different helpers, such that Ti = 1, T2 = 2, and so on. Since the chunks are downloaded 
from different neighboring helpers with different queue lengths and downlink channel capacities, they 
may be received out of order. For example, it may happen that Af^ < Aj for some j < k. Hence, chunk k 
cannot be played until all chunks j for j < k are also received. We say that a chunk k becomes playable 
when all the chunks j < k are received. Let P^ denote the time when chunk k becomes playable. Then, 
we have: 



Pk = ma.x{Ai,A2, ..., A^}. 



(67) 
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The proposed policy consists of two parts: skipping chunks from playback and buffering policy. We 
examine these two features separately in the following. 

A. Skipping chunks from playback 

Prior to slot t, the set of playable chunks is {A; : P^ < t — 1} and 

kl_^ = max{fc : Pfc < t - 1} (68) 

is the highest-order chunk in the ordered sequence of playable chunks. At the end of slot t, user u 
considers the set Ct of all chunks which have arrived before or during slot t and which come later than 
kl_i in the ordered sequence of playback. The set Ct is given by: 

Ct = {k:k> kl_^, Ak < t}. (69) 

The next available chunk with order larger than A;*_^ is given by: 

fcj" = minjA; : k > k^^^, Ak < t}. (70) 

Let C* C Ct be the set of chunks which become playable at the end of slot t, i.e., 

C: = {k:Ak<t,Pk = t}. (71) 

If k^ comes next to k*_-^ in the playback order (i.e. if k^ = k*_^ + 1), then C* is non-empty and all 
the chunks k E Q* can be added to the playback buffer. Further, kt is recursively updated as: 

k; = kl, + \Cn. (72) 

Denoting the increment in the size of the playback buffer at the end of slot t hy At, we have in this 
case that Aj = |Cj*|. On the other hand, if k^ is not the immediate successor of k^_-^ in the playback 
order (i.e. if k^ > k^_i + 1), then there is no chunk in Ct which becomes playable at the end of slot 
t and therefore Ct = 0. In this case, the algorithm compares \Ct\ with a threshold p in order to decide 
whether it should wait further for the missing chunk k*_^ + 1 or skip it in order to increase the playback 
buffer anyway. As explained in the qualitative discussion at the beginning of Section |Vl the intuition 
behind such a decision is that it is worthwhile to skip a chunk if skipping such a chunk results in a large 
jump in the playback buffer size. The size of this possible jump can be exactly computed from Ct as 
follows: assuming k^ = k^^^ + 2, if we skip chunk k'l_^ + 1, then the increase in the playback buffer is 
given by the size of the set: 

{j > 2 : k;_^+ieCty2<i< j}. 
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We therefore propose the following policy: if \Ct\ < p (where p is a parameter > 0), then wait for 
chunk k^_i + l and let k^ = kf_i. Otherwise, if \Ct\ > p, the increase of the playback buffer is worthwhile 
and therefore it is useful to skip chunk kl_i + 1. In this case, if k^ = kl_i + 2, then A;^* is updated as 

kl = kl_^ + max{j : A;;„i + i V 2 < i < j} (73) 

and all the chunks numbered from kl_i + 2 to k^ are made playable at the end of slot t and added to 
the playback buffer. We therefore have Aj = |{j > 2 : fc^„^ + iV2<i<j}|in this case. Instead, if 
A;^" > fc^_^ + 2, then the user skips chunk k*_^ + 1 and starts waiting for chunk k*_^ + 2. Only a single 
chunk is allowed to be skipped per slot because skipping multiple chunks might cause damage to the 
quality of experience of the user. Note that in this case, k^ is updated to fc^„^ + 1 even though the chunk 
kl_^ + 1 is missing and is not playable. This is to ensure that when chunk fc^_^ + 2 is received, it is 
considered playable despite the fact that chunk kl_i + 1 is missing. Also note that there is no increment 
in the playback buffer (i.e., Aj = 0) in this case because there is no new chunk which becomes playable. 
Note that choosing p = oo corresponds to the case when no chunk is skipped. 

B. Pre-bujfering and re-buffering 

The goal here is to determine the delay T„ after which user u should start playback, with respect to 
the time at which the first chunk is requested (beginning of the streaming session). Intuitively, choosing 
a large T^ makes the probability of stall events small. However, a too large T„ is very annoying for the 
user's quality of experience. From the chunk skipping strategy seen above, we know that At is the number 
of new chunks added to the playback buffer at the end of slot t. We define the size of the playback buffer 
^t as the number of playable chunks in the buffer not yet played. Without loss of generality, assume 
again that the streaming session starts at i = 1. Then, ^t is recursively given by the updating equationjj 

^t = max{^t_i-l{t>T„},0} + Aj. (74) 

From the qualitative discussion on the evolution of the playback buffer at the beginning of Section |V] 
we notice that the longest period during which ^^ is not incremented (in the absence of chunk skipping 
decisions) is given by the maximum delay Wk to deliver chunks. In addition, we note that each user u 
needs to adaptively estimate W^ in order to choose T^. In the proposed method, user u calculates for 
every chunk k the corresponding delay Wk = A^ — Tk. Notice that the delay of chunk k, can be calculated 
only at time Ak, i.e., when the chunk is actually delivered. At each time t = 1, 2, . . ., user u calculates 

'l{/C} denotes the indicator function of a condition or event /C. 
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the maximum observed delay Et in a sliding window of size A, (in all the numerical experiments in the 
sequel, we use A = 10) by letting: 

Et = m.a^{Wk : t- A + 1 <Ak<t}. (75) 

Finally, user u starts its playback when ^t crosses the level £,Et, i.e., 

T„ = min{t: ^t>^Et}. (76) 

If a stall event occurs at time t, i.e., ^t = for t > T„, the algorithm enters a re-buffering phase in 
which the same algorithm presented above is employed again to determine the new instant t + Tu + l at 
which playback is restarted. Notice that even if here, with some abuse of notation, we have denoted the 
re-buffering delay again by Tu, in fact this is re-estimated using the sliding window method at every new 
stall event. In fact, if a stall event occurs it is likely that some change in the network state has occurred, 
such that the maximum delay must be re-estimated. 

VI. Numerical Experiments 

We consider a 400m x 400m square area divided into 5x5 small square cells of side length 80m as 
shown in Figure |2] A helper is located at the center of each small square cell. Each helper serves only 
those users within a radius of 60 m. As described in Section U the helpers could be connected to some 
video content delivery network through a wired backbone or they could be dedicated nodes with local 
caching capacity. In these simulations we assume that each helper has available the whole video library. 
Therefore, for any request /„ we have J\f{u) n'H{fu) = M{u). We further assume that there are 2 users 
uniformly and independently distributed in each small cell. We use the utility function 0u(x) = log(x) 
for all u gU. We assume a physical layer inspired by LTE specifications ll26l . As described in Section UIl 
with a scheduling slot duration of 0.5s and a total available system bandwidth of 18 MHz, we have 10^ 
LTE resource blocks. Thus, the total number of channel symbols n available in a scheduling slot is 
10^ X 84. 

Between any two points a and b in the square area, the path loss is given by g{a, b) = — . ^(\^)x„ where 
(5 = 40 m and a = 3.5. Each helper transmits at a power level such that the SNR per symbol (without 
interference) at the center (i.e., at distance d{a, b) = from the transmitter) is 20 dB. The point-to-point er- 
godic capacities in ([3]) are calculated by using the approximate formula: Chu{t) = e^/^''"(*)Ei ( 1, p ^, . j 
where Ei(l, x) = J^ ^dt for x > 0, and Thu{t) = -^ , ^^ hgh^{) ^ jj^j^ formula is easy to derive 
and is known to be accurate. 
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Fig. 2: Toplogy (the green line indicates tlie trajectory of a mobile user in Experiment 2). 



We assume that all the users request chunks successively from VBR-encoded video sequences. Each 
video file is a long sequence of chunks, each of duration 0.5 seconds and with a frame rate of 30 frames 
per second. We consider a specific video sequence formed by 800 chunks, constructed using 4 video 
clips from the database in [27], each of length 200 chunks. The chunks are encoded into different quality 
modes. Here, the quality index is measured using the Structural SIMilarity (SSIM) index defined in 
ll28l . Figures l3al and l3b] show the size in kbits and the SSIM values as a function of the chunk index, 
respectively, for the different quality modes. In our experiments, the chunks from 1 to 200 and 601 to 
800 are encoded into 8 quality modes, while the chunks numbered from 201 to 600 are encoded in 4 
quality modes. In all the numerical experiments in the sequel, each user starts its streaming session of 
1000 chunks from some arbitrary position in this reference video sequence and successively requests 
1000 chunks by cycling through the sequence. 



A. Experiment 1 

At t = 0, all users start their streaming session as explained before, for the duration of 1000 chunks. 
We identify the following five key performance metrics: 1) The percentage of skipped chunks due to 
large delays; 2) The initial pre-buffering time (in number of slots) or the startup delay Tu before starting 
playback; 3) The number of times playback gets interrupted and enters re-buffering mode (frequency of 
stall events per streaming session); 4) The percentage of total playback time which is spent in re-buffering 
mode; 5) The quality (SSIM) averaged over the delivered (i.e., the chunks which are not skipped) chunks. 

The parameters governing the DPP policy and skipping/buffering policy are V, ^ and p. We have studied 
the effect of each individual parameter while keeping the other two fixed to some reasonable value, in 
terms of the above key performance metrics. Figure @a]shows the cumulative distribution function (CDF) 
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(a) bitrate profile (b) Quality profile 

Fig. 3: Rate-quality profile of video sequence. 



of quality (averaged over delivered chunks) over the user population for the values 2V, 4V, 6V, 8V and 
lOF with V = 10^^, for fixed p = 20 and ^ = 10. We can notice the fairness in service as the policy 
achieves a value close to the optimum for large V. 

Figures |4b] - |4d] show CDFs of the pre-buffering and re-buffering quantities for different values of 
^, for fixed V = 10^'^ and p = 20. We can observe from these figures that the parameter ^ affects the 
tradeoff between the pre-buffering time and the re-buffering quantities, namely, the number of re-buffering 
periods and the percentage of playback time spent in re-buffering mode. From the buffering policy, we 
can see that a larger ^ sets a larger threshold for starting playback resulting in larger pre-buffering times 
as reflected in Figure |4b] However, with larger pre-buffering times, we have lesser number of re-buffering 
periods and the playback spends a lesser fraction of the time in re-buffering mode, as reflected in Figures 
|4c]and|4dl 

Finally, Figures |4e] - |4g| show analogous CDFs for different values of p with V = 10^^ and ^ = 25. 



We can observe that p affects the tradeoff between the percentage of skipped chunks and the buffering 
metrics. From the buffering policy, we can see that if we choose a larger p, then the buffer waits longer for 
undelivered chunks and accommodates chunks with larger delays. As a result, the percentage of chunks 
which are skipped decreases, as seen in Figure @e] Further, by waiting longer for chunks with large 
delays, the buffering policy allows longer pre-buffering and re-buffering times, as evidenced in Figures 



and 4g Note that when p is set to oo, no chunk is skipped as seen in Figure |4e] Thus, p can be chosen 



to be equal to oo if one wishes to avoid skipping any chunks altogether. However, the chunk-skipping 
policy provides every user u the flexibility to skip a small percentage of chunks in order to improve the 



26 

Streaming performance in terms of the pre-buffering and re- buffering performance metrics. In fact, as 
is evidenced in Figures |4f| and |4gj only a small percentage of outlier chunks have large delays and by 
skipping them, there is a drastic improvement in the pre-buffering and re-buffering times. 

B. Experiment 2 

In this case the setup is the same as in Experiment 1 but now we consider a specific user, indicated by 
ui, moving slowly across the square grid along the green path indicated in Figure dU, during the 1000 
slots of simulation. We fix the parameters V = 10^"^, ^ = 25 and p = 50. For the chosen parameters, we 
observe that the percentage of chunks which are skipped by the mobile user is 1.3% and the pre-buffering 
time is 162 time slots. Furthermore, from Figure |5b] showing the evolution of the playback buffer ^t 
over time, we notice that there are no interruptions and the playback never enters the re-buffering mode. 
The helpers are numbered from 1 to 25, left to right and bottom to top, in Figure |2] In Figure [Sa] we 
plot the helper index providing chunk k = 1, . . . , 1000 vs. the chunk index. We can observe that as 
the user moves slowly along the path, the DPP policy "discovers" adaptively the current neighboring 
helpers and downloads chunks from them in a seamless fashion. Overall, these results are indicative 
of the dynamic and adaptive nature of the DPP policy in response to arbitrary variations of large-scale 
pathloss coefficients due to mobility. 

C. Experiment 3 

We assume the same system setup as the previous experiments but now simulate the situation where 
users alternate between idle and active phases. Initially at t = 0, all users are idle and each user 
independently starts a streaming session with probability p = 0.005 at every slot. Thus, the time for which 
a user stays idle is geometrically distributed with mean - = 200 slots. Once a user starts a streaming 
session, it stays active during 1000 video chunks. After finishing the requests, it goes back into the idle 
state and may start a new session after a random geometrically distributed idle time. We run the simulation 
for 3000 slots for different values of the key parameters V, ^ and p. We evaluate the performance using 
the same metrics as in Experiment 1 but with the following differences: 1) The percentage of skipped 
chunks and the quality (SSIM) averaged over the delivered chunks are now calculated with respect to 
the total number of chunks spanning multiple streaming sessions of the user; 2) The initial pre-buffering 
time (in number of slots) is calculated for each streaming session and the CDF is plotted over all the 
sessions; 3) The number of times playback gets interrupted and enters re-buffering mode is normalized 
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by the number of streaming sessions of the user; 4) The percentage of time spent in re-buffering mode 
is calculated with respect to the total playback time spanning multiple streaming sessions of the user; 

We note that the results are very similar to those observed in Experiment 1. For instance, Figure |5c] 
shows a similar fairness in service as V is increased and Figures |5d] and |5e] show how the parameter ^ 
affects a similar tradeoff between the pre-buffering time and the frequency of re-buffering or stall events 



per streaming session. Finally, Figures |5fl and 5g show how the parameter p affects a similar tradeoff 
between the percentage of skipped chunks and the percentage of playback time spent in re-buffering 
mode. 
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Fig. 5: (a) and (b) coiTespond to Experiment 2. (c)-(g) indicate the CDF's of different performance metrics for Experiment 3. 



